Apache Sqoop
Apache Sqoop is a tool designed for efficiently transferring the data between Hadoop and Relational Databases (RDBMS). The following are a few highlights of this project:
- Sqoop can efficiently transfer bulk data between HDFS and Relational Databases.
- Sqoop allows importing the data into HDFS in an incremental fashion.
- Sqoop can import and export data to and from HDFS, Hive, Relational Databases and Data Warehouses.
- Sqoop uses MapReduce to import and export of data thereby effectively utilizing the parallelism and fault tolerance features of Hadoop.
- Sqoop offers a command line commonly referred to as Sqoop command line.
Additional Information: Home Page | Wiki | Documentation/User Guide/Reference Manual | Mailing Lists